3574 results found.
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
100M entries Production Status:
Existing-used
Use:
Semantic Similarity
-
Paper title:Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mika Hasegawa | Waseda University | JP |
| Author 2 | Tetsunori Kobayashi | Waseda University | JP |
| Author 3 | Yoshihiko Hayashi | Waseda University | JP |
| Main Contact | Mika Hasegawa | Waseda University | None |
Documentation:
This dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the photos and videos provided in the list are licensed under one of the Creative Commons copyright licenses, and as such they can be used for benchmarking purposes as long as the photographer/videographer is credited for the original creation.
Written
Annotation Tool,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:SACR: A Drag-and-Drop Based Tool for Coreference Annotation
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Bruno Oberle | University of Strasbourg | FR |
| Main Contact | Bruno Oberle | University of Strasbourg | None |
Documentation:
<Not Specified>
Written
Software Toolkit,
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
Freely Available
License:
OpenSource
Size:
150 MByte Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:OSMAN – A Novel Arabic Readability Metric
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mahmoud El-Haj | Lancaster University | GB |
| Author 2 | Paul Rayson | Lancaster University | GB |
| Main Contact | Mahmoud El-Haj | Lancaster University | None |
Documentation:
OSMAN Arabic Readability Metrics
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Document Annotations Released by Web Interface
License:
<Not Specified>
Size:
10700 sentences Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:A Multi-Layered Annotated Corpus of Scientific Papers
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Beatriz Fisas | TALN Research Group, Pompeu Fabra University | ES |
| Author 2 | Francesco Ronzano | Natural Language Processing Group (TALN), Universitat Pompeu Fabra, Barcelona | ES |
| Author 3 | Horacio Saggion | Universitat Pompeu Fabra | ES |
| Main Contact | Beatriz Fisas | TALN Research Group, Pompeu Fabra University | None |
Documentation:
to be released
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
65000 <Not Specified>Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:TIMEN: An Open Temporal Expression Normalisation Resource
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Hector Llorens | <Not Specified> | None |
| Author 2 | Leon Derczynski | <Not Specified> | None |
| Author 3 | Robert Gaizauskas | <Not Specified> | None |
| Author 4 | Estela Saquete | <Not Specified> | None |
| Main Contact | Leon Derczynski | University of Sheffield | GB |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Word Sense Disambiguation
-
Paper title:UFSAC: Unification of Sense Annotated Corpora and Tools
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Loïc Vial | LIG | FR |
| Author 2 | Benjamin Lecouteux | LIG | FR |
| Author 3 | Didier Schwab | Univ. Grenoble Alpes | FR |
| Main Contact | Loïc Vial | LIG | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
1 MByte Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Modeling Stance in Student Essays
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Submitfinal
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Isaac Persing | University of Texas at Dallas | US |
| Author 2 | Vincent Ng | University of Texas at Dallas | US |
| Main Contact | Isaac Persing | University of Texas at Dallas | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
15,553 tweets OtherProduction Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
-
Paper title:EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jasy Suet Yan Liew | School of Information Studies, Syracuse University | US |
| Author 2 | Howard R. Turtle | School of Information Studies, Syracuse University | US |
| Author 3 | Elizabeth D. Liddy | School of Information Studies, Syracuse University | US |
| Main Contact | Jasy Suet Yan Liew | School of Information Studies, Syracuse University | None |
Documentation:
<Not Specified>
Not Applicable
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
CreativeCommons
Size:
1.7 MByte Production Status:
Newly created-finished
Use:
Named Entity Recognition (NER)
-
Paper title:Location Name Extraction from Targeted Text Streams using Gazetteer-based Statistical Language Models
-
Paper track:Computationally-aided linguistic analysis
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Hussein Al-Olimat | Kno.e.sis, Wright State University | US |
| Author 2 | Krishnaprasad Thirunarayan | Kno.e.sis, Wright State University | N/A |
| Author 3 | Valerie Shalin | Kno.e.sis, Wright State University | N/A |
| Author 4 | Amit Sheth | Kno.e.sis, Wright State University | N/A |
| Main Contact | Hussein Al-Olimat | Kno.e.sis, Wright State University | None |
Documentation:
Yes, available with the data
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Czech English
Availability:
<Not Specified>
License:
open
Size:
En: 5327 collocs for 102 hwds; Cz: 1378 for 85 OtherProduction Status:
Newly created-finished
Use:
Evaluation of corpora or various NLP tools
-
Paper title:Extrinsic Corpus Evaluation with a Collocation Dictionary Task
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Adam Kilgarriff | Lexical Computing Ltd | GB |
| Author 2 | Pavel Rychlý | NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic | CZ |
| Author 3 | Miloš Jakubíček | NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic | CZ |
| Author 4 | Vojtěch Kovář | NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic | CZ |
| Author 5 | Vit Baisa | NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic | CZ |
| Author 6 | Lucia Kocincová | NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic | CZ |
| Main Contact | Adam Kilgarriff | Lexical Computing Ltd | None |
Documentation:
The paper is the documentation




